Factor Analysis Segmentation and Classification in Broadcast News Domain
نویسندگان
چکیده
This paper proposes a study of a Factor Analysis (FA) segmentation and classification system. Our approach is inspired by language recognition systems where every input sequence is a language. Following this idea, a study between the classic segmentation systems based on HMM/GMM and FA is done over the output of a perfect segmentation system (oracle boundaries). It can be seen how FA improves the classification results compared to HMM/GMM. Also, the first experiments of an on-building FA segmentation system are reported suggesting the need to improve the channel compensation over some classes.
منابع مشابه
Speaker role based structural classification of broadcast news stories
This paper is concerned with automatic classification of broadcast news stories based on speaker roles such as anchor, reporter and others. The story classification is the first step for many related tasks such as browsing, indexing, and summarising the news broadcast. We use broadcast news audio and its automatic speech recogniser transcripts to implement the classification system. It builds o...
متن کاملAdvances in automatic transcription of Italian broadcast news
This paper presents some recent improvements in automatic transcription of Italian broadcast news obtained at ITCirst. A first preliminary activity was carried out in order to develop a suitable speech corpus for the Italian language. The resulting corpus, formed by recordings covering 30 hours of radio news, was exploited for developing a baseline system for transcription of broadcast news. Th...
متن کاملAudio segmentation, classification and clustering in a broadcast news task
This paper describes our work on the development of an audio segmentation, classification and clustering system applied to a Broadcast News task for the European Portuguese language. We developed a new algorithm for audio segmentation that is both accurate and uses less computational resources than other approaches. Our speaker clustering module uses a modified BIC algorithm which performs subs...
متن کاملSegmentation, Classification and Clustering of an Italian Broadcast News Corpus
This work reports on preliminary activity at ITC-irst on the problem of acoustic segmentation, classification and clustering of an Italian audio broadcast news corpus. The approach is based on the following stages. First, the input data stream is segmented by detecting spectral changes through the Bayesian Information Criterion (BIC). Second, segments are classified in terms of acoustic conditi...
متن کاملSegment Generation and Clustering in the HTK Broadcast News Transcription System
This paper describes the segmentation, gender detection and segment clustering scheme used in the 1997 HTK broadcast news evaluation system and presents results on both the unpartitioned 1996 development and the 1997 evaluation sets. The stages of our approach are presented, namely classification, segmentation and gender detection, gender relabelling, and clustering of speech segments. The eval...
متن کامل